Recurrence relation

In mathematics, a recurrence relation is an equation that recursively defines a sequence: each term of the sequence is defined as a function of the preceding terms.

The term difference equation sometimes (and for the purposes of this article) refers to a specific type of recurrence relation. Note however that "difference equation" is frequently used to refer to any recurrence relation.

An example of a recurrence relation is the logistic map:

x_{n+1} = r x_n (1 - x_n) \,

Some simply defined recurrence relations can have very complex (chaotic) behaviours, and they are a part of the field of mathematics known as nonlinear analysis.

Solving a recurrence relation means obtaining a closed-form solution: a non-recursive function of n.

Contents

Example: Fibonacci numbers

The Fibonacci numbers are defined using the linear recurrence relation

F_{n} = F_{n-1}+F_{n-2} \,

with seed values:

F_0 = 0 \,
F_1 = 1 \,

Explicitly, recurrence yields the equations:

F_2 = F_1 + F_0 \,
F_3 = F_2 + F_1 \,
F_4 = F_3 + F_2 \,

etc.

We obtain the sequence of Fibonacci numbers which begins:

0, 1, 1, 2, 3, 5, 8, 13, 21, 34, 55, 89, ...

It can be solved by methods described below yielding the closed form expression which involve powers of the two roots of the characteristic polynomial t^2=t+1; the generating function of the sequence is the rational function \frac{t}{1-t-t^2}.

Structure

Linear homogeneous recurrence relations with constant coefficients

An order d linear homogeneous recurrence relation with constant coefficients is an equation of the form:

a_n = c_1a_{n-1} + c_2a_{n-2}+\cdots+c_da_{n-d}  \,

where the d coefficients ci (for all i) are constants.

It can be shown that, in general, an order d linear homogeneous recurrence relation with constant coefficients can be expressed as the sum of d different geometric progressions with different common ratios. An exception occurs when the equation that normally determines the common ratios of those geometric progressions fails to have all its roots distinct[1]. Such an expression as a sum of geometric progressions is called a Binet formula.[2] (However, the term Binet formula is also frequently used with specific reference to the expression of the Fibonacci sequence as the sum of two power sequences; see Fibonacci number#Relation to the golden ratio.)

More precisely, this is an infinite list of simultaneous linear equations, one for each n>d−1. A sequence which satisfies a relation of this form is called a linear recursive sequence or LRS. There are d degrees of freedom for LRS, the initial values a_0,\dots,a_{d-1} can be taken to be any values but then the linear recurrence determines the sequence uniquely.

The same coefficients yield the characteristic polynomial (also "auxiliary polynomial")

p(t)= t^d - c_1t^{d-1} - c_2t^{d-2}-\cdots-c_{d}\,

whose d roots play a crucial role in finding and understanding the sequences satisfying the recurrence.

Rational generating function

Linear recursive sequences are precisely the sequences whose generating function is a rational function: the denominator is the auxiliary polynomial (up to a transform), and the numerator is obtained from the seed values.

The simplest cases are periodic sequences, a_n = a_{n-d}, n\geq d, which have sequence a_0,a_1,\dots,a_{d-1},a_0,\dots and generating function a sum of geometric series:


\begin{align}
& \frac{a_0 + a_1 x^1 + \cdots + a_{d-1}x^{d-1}}{1-x^d} \\[6pt]
& = \left(a_0 + a_1 x^1 + \cdots + a_{d-1}x^{d-1}\right) \\[3pt]
& {} \quad + \left(a_0 + a_1 x^1 + \cdots + a_{d-1}x^{d-1}\right)x^d \\[3pt]
& {} \quad + \left(a_0 + a_1 x^1 + \cdots + a_{d-1}x^{d-1}\right)x^{2d} + \cdots.
\end{align}

More generally, given the recurrence relation:

a_n = c_1a_{n-1} + c_2a_{n-2}+\cdots+c_da_{n-d}  \,

with generating function

a_0 + a_1x^1 + a_2 x^2 + \cdots,

the series is annihilated at a_d and above by the polynomial:

1- c_1x^1 - c_2 x^2 - \cdots - c_dx^d. \,

That is, multiplying the generating function by the polynomial yields

b_n = a_n - c_1 a_{n-1} - c_2 a_{n-2} - \cdots - c_d a_{n-d} \,

as the coefficient on x^n, which vanishes (by the recurrence relation) for n \geq d. Thus

(a_0 + a_1x^1 + a_2 x^2 + \cdots {} ) (1- c_1x^1 - c_2 x^2 - \cdots - c_dx^d) = (b_0 + b_1x^1 + b_2 x^2 + \cdots + b_{d-1} x^{d-1})

so dividing yields

a_0 + a_1x^1 + a_2 x^2 + \cdots  =
\frac{b_0 + b_1x^1 + b_2 x^2 + \cdots + b_{d-1} x^{d-1}}{1- c_1x^1 - c_2 x^2 - \cdots - c_dx^d},

expressing the generating function as a rational function.

The denominator is x^d p\left(\tfrac 1 d\right), a transform of the auxiliary polynomial (equivalently, reversing the order of coefficients); one could also use any multiple of this, but this normalization is chosen both because of the simple relation to the auxiliary polynomial, and so that b_0 = a_0.

Relationship to difference equations narrowly defined

Given an ordered sequence \left\{a_n\right\}_{n=1}^\infty of real numbers: the first difference \Delta(a_n)\, is defined as

\Delta(a_n) = a_{n+1} - a_n\,.

The second difference \Delta^2(a_n)\, is defined as

\Delta^2(a_n) = \Delta(a_{n+1}) - \Delta(a_n)\,,

which can be simplified to

\Delta^2(a_n) = a_{n+2} - 2a_{n+1} + a_n\,.

More generally: the kth difference of the sequence a_n\, is written as \Delta^k(a_n)\, is defined recursively as

\Delta^k(a_n) = \Delta^{k-1}(a_{n+1}) - \Delta^{k-1}(a_n)\,.

The more restrictive definition of difference equation is an equation composed of an and its kth differences. (A widely used broader definition treats "difference equation" as synonymous with "recurrence relation". See for example rational difference equation and matrix difference equation.)

Linear recurrence relations are difference equations, and conversely; since this is a simple and common form of recurrence, some authors use the two terms interchangeably. For example, the difference equation

3\Delta^2(a_n) + 2\Delta(a_n) + 7a_n = 0\,

is equivalent to the recurrence relation

3a_{n+2} = 4a_{n+1} - 12a_n\,

Thus one can solve many recurrence relations by rephrasing them as difference equations, and then solving the difference equation, analogously to how one solves ordinary differential equations; but it would be very difficult to make the Ackermann numbers into a difference equation, much less points on the solution to a differential equation.

See time scale calculus for a unification of the theory of difference equations with that of differential equations.

Summation equations relate to difference equations as integral equations relate to differential equations.

From sequences to grids

Single-variable or one-dimensional recurrence relations are about sequences (i.e. functions defined on one-dimensional grids). Multi-variable or n-dimensional recurrence relations are about n-dimensional grids. Functions defined on n-grids can also be studied with partial difference equations.[3]

Solving

General methods

For order 1 no theory is needed; the recurrence

a_{n}=r a_{n-1} \,

has the obvious solution  a_n=r^n with a_0=1 and the most general solution is a_n=k  r^n with a_0=k. Note that the characteristic polynomial equated to zero (the characteristic equation) is simply tr=0.

Solutions to such recurrence relations of higher order are found by systematic means, often using the fact that a_n=r^n is a solution for the recurrence exactly when t=r is a root of the characteristic polynomial. This can be approached directly or using generating functions (formal power series) or matrices.

Consider, for example, a recurrence relation of the form

a_{n}=Aa_{n-1}+Ba_{n-2}. \,

When does it have a solution of the same general form as an = rn? Substituting this guess (ansatz) in the recurrence relation, we find that

r^{n}=Ar^{n-1}+Br^{n-2} \, must be true for all n>1.

Dividing through by rn−2, we get that all these equations reduce to the same thing:

r^2=Ar+B, \,
r^2-Ar-B=0, \,

which is the characteristic equation of the recurrence relation. Solve for r to obtain the two roots λ1, λ2: these roots are known as the characteristic roots or eigenvalues of the characteristic equation. Different solutions are obtained depending on the nature of the roots: If these roots are distinct, we have the general solution

a_n = C\lambda_1^n+D\lambda_2^n \,

while if they are identical (when A2 + 4B = 0), we have

a_n = C\lambda^n+Dn\lambda^n \,

This is the most general solution; the two constants C and D can be chosen based on two given initial conditions a0 and a1 to produce a specific solution.

In the case of complex eigenvalues (which also gives rise to complex values for the solution parameters C and D), the use of complex numbers can be eliminated by rewriting the solution in trigonometric form. In this case we can write the eigenvalues as \lambda_1, \lambda_2 = \alpha  \pm \beta i. Then it can be shown that a_n = C\lambda_1^n+D\lambda_2^n \, can be rewritten as[4]:576-585

a_n = 2 M^n \left( E \cos(\theta n) + F \sin(\theta n)\right) = 2 G M^{n} \cos(\theta n - \delta),

where

\begin{array}{lcl}
  M = \sqrt{\alpha^2+\beta^2} & \cos \theta=\tfrac{\alpha}{M} & \sin \theta = \tfrac{\beta}{M} \\
  C,D = E \mp F i & & \\
  G = \sqrt{E^2+F^2} & \cos \delta = \tfrac{E}{G} & \sin \delta = \tfrac{F}{G}
\end{array}

Here E and F (or equivalently, G and \delta) are real constants which depend on the initial conditions.

Note that in all cases—real distinct eigenvalues, real duplicated eigenvalues, and complex conjugate eigenvalues—the equation is stable (that is, the variable a converges to a fixed value (specifically, zero)); if and only if both eigenvalues are smaller than one in absolute value. In this second-order case, this condition on the eigenvalues can be shown[5] to be equivalent to |A| < 1 – B < 2.

The equation in the above example was homogeneous, in that there was no constant term. If one starts with the non-homogeneous recurrence

b_{n}=Ab_{n-1}+Bb_{n-2}+K \,

with constant term K, this can be converted into homogeneous form as follows: The steady state is found by setting bn = bn−1 = bn−2 = b* to obtain

 b^{*} = \frac{K}{1-A-B}. \,

Then the non-homogeneous recurrence can be rewritten in homogeneous form as

[b_{n}-b^{*}]=A[b_{n-1}-b^{*}]+B[b_{n-2}-b^{*}], \,

which can be solved as above.

Note also that the stability condition stated above in terms of eigenvalues for the second-order case remains valid for the general nth-order case: the equation is stable if and only if all eigenvalues of the characteristic equation are less than one in absolute value.

Solving via linear algebra

Given a linearly recursive sequence, let C be the transpose of the companion matrix of its characteristic polynomial, that is

\begin{bmatrix}
0 & 1 & 0 & \cdots & 0\\
0 & 0 & 1 & \cdots & 0\\
\vdots & \vdots & \vdots & \ddots & \vdots\\
0 & 0 & 0 & \cdots & 1\\
-c_0 & -c_1 & -c_2 & \cdots & -c_{d-1}
\end{bmatrix}

where T_n = c_{d-1}T_{n-1} + c_{d-2}T_{n-2} + \cdots + c_{0}T_{n-d}. Call this matrix C. Observe that

\begin{bmatrix}a_n\\
\vdots\\
a_{n+(d-1)}\end{bmatrix}
= C^n\begin{bmatrix}a_0\\
\vdots\\
a_{d-1}\end{bmatrix}

Determine an eigenbasis v_1,...,v_d corresponding to eigenvalues \lambda_1,...,\lambda_d. Then express the seed (the initial conditions of the LRS) as a linear combination of the eigenbasis vectors:

\begin{bmatrix}a_0\\
\vdots\\
a_{d-1}\end{bmatrix} = b_1v_1 + \cdots + b_dv_d

Then it conveniently works out that:

\begin{bmatrix}a_n\\
\vdots\\
a_{n+(d-1)}\end{bmatrix}
= C^n\begin{bmatrix}a_0\\
\vdots\\
a_{d-1}\end{bmatrix}
= C^n(b_1v_1 + \cdots + b_dv_d)
= \lambda_1^nb_1v_1 + \cdots + \lambda_d^n b_dv_d

This description is really no different from general method above, however it is more succinct. It also works nicely for situations like

a_n=a_{n-1}-b_{n-1}\,.
b_n=2a_{n-1}+b_{n-1}\,.

Where there are several linked recurrences .

Solving with z-transforms

Certain difference equations, in particular Linear constant coefficient difference equations, can be solved using z-transforms. The z-transforms are a class of integral transforms that lead to more convenient algebraic manipulations and more straightforward solutions. There are cases in which obtaining a direct solution would be all but impossible, yet solving the problem via a thoughtfully chosen integral transform is straightforward.

Theorem

Given a linear homogeneous recurrence relation with constant coefficients of order d, let p(t) be the characteristic polynomial (also "auxiliary polynomial")

t^d - c_1t^{d-1} - c_2t^{d-2}-\cdots-c_{d} = 0 \,

such that each ci corresponds to each ci in the original recurrence relation (see the general form above). Suppose λ is a root of p(t) having multiplicity r. This is to say that (t−λ)r divides p(t). The following two properties hold:

  1. Each of the r sequences \lambda^n, n\lambda^n, n^2\lambda^n,\dots,n^{r-1}\lambda^n \, satisfies the recurrence relation.
  2. Any sequence satisfying the recurrence relation can be written uniquely as a linear combination of solutions constructed in part 1 as λ varies over all distinct roots of p(t).

As a result of this theorem a linear homogeneous recurrence relation with constant coefficients can be solved in the following manner:

  1. Find the characteristic polynomial p(t).
  2. Find the roots of p(t) counting multiplicity.
  3. Write an as a linear combination of all the roots (counting multiplicity as shown in the theorem above) with unknown coefficients bi.
a_n = (b_1\lambda_1^n + b_2n\lambda_1^n + b_3n^2\lambda_1^n+\cdots+b_{r}n^{r-1}\lambda_1^n)+\cdots+(b_{d-q+1}\lambda_{*}^n + \cdots + b_{d}n^{q-1}\lambda_{*}^n) \,
This is the general solution to the original recurrence relation.
(Note: q is the multiplicity of λ*)
4. Equate each a_0, a_1, a_2,\dots,a_d \, from part 3 (plugging in n = 0,\dots,d \, into the general solution of the recurrence relation) with the known values a_0, a_1, a_2,\dots,a_d \, from the original recurrence relation. Note, however, that the values an from the original recurrence relation used do not have to be contiguous, just d of them are needed (i.e., for an original linear homogeneous recurrence relation of order 3 one could use the values a0, a1, a4). This process will produce a linear system of d equations with d unknowns. Solving these equations for the unknown coefficients b_1, b_2, b_3,\dots,b_d of the general solution and plugging these values back into the general solution will produce the particular solution to the original recurrence relation that fits the original recurrence relation's initial conditions (as well as all subsequent values a_0,a_1,a_2,a_3,\dots of the original recurrence relation).

Interestingly, the method for solving linear differential equations is similar to the method above—the "intelligent guess" (ansatz) for linear differential equations with constant coefficients is e^{\lambda x}\, where λ is a complex number that is determined by substituting the guess into the differential equation.

This is not a coincidence. If you consider the Taylor series of the solution to a linear differential equation:


\sum_{n=0}^{\infin} \frac{f^{(n)}(a)}{n!} (x-a)^{n}

you see that the coefficients of the series are given by the nth derivative of f(x) evaluated at the point a. The differential equation provides a linear difference equation relating these coefficients.

This equivalence can be used to quickly solve for the recurrence relationship for the coefficients in the power series solution of a linear differential equation.

The rule of thumb (for equations in which the polynomial multiplying the first term is non-zero at zero) is that:


y^{[k]} \to  f[n+k]

and more generally


x^m*y^{[k]} \to n(n-1)(n-m+1)f[n+k-m]

Example: The recurrence relationship for the Taylor series coefficients of the equation:

 (x^2 + 3x -4)y^{[3]} -(3x+1)y^{[2]} + 2y = 0\,

is given by

 n(n-1)f[n+1] + 3nf[n+2] -4f[n+3] -3nf[n+1] -f[n+2]+ 2f[n] = 0\,

or

-4f[n+3] +2nf[n+2] + n(n-4)f[n+1] +2f[n] = 0.\,

This example shows how problems generally solved using the power series solution method taught in normal differential equation classes can be solved in a much easier way.

Example: The differential equation

ay'' + by' +cy = 0\,

has solution

 y=e^{ax}.\,

The conversion of the differential equation to a difference equation of the Taylor coefficients is

af[n + 2] + bf[n + 1] + cf[n] = 0\,.

It is easy to see that the nth derivative of eax evaluated at 0 is an

Solving non-homogeneous recurrence relations

If the recurrence is inhomogeneous, a particular solution can be found by the method of undetermined coefficients and the solution is the sum of the solution of the homogeneous and the particular solutions. Another method to solve an inhomogeneous recurrence is the method of symbolic differentiation. For example, consider the following recurrence:

a_{n+1} = a_{n} + 1\,

This is an inhomogeneous recurrence. If we substitute n \mapsto n + 1, we obtain the recurrence

a_{n+2} = a_{n+1} + 1\,

Subtracting the original recurrence from this equation yields

a_{n+2} - a_{n+1} = a_{n+1} - a_{n}\,

or equivalently

a_{n+2} = 2 a_{n+1} - a_{n}\,

This is a homogeneous recurrence which can be solved by the methods explained above. In general, if a linear recurrence has the form

 a_{n+k} = \lambda_{k-1} a_{n+k-1} + \lambda_{k-2} a_{n+k-2} + \cdots + \lambda_1 a_{n+1} + \lambda_0 a_{n} + p(n)

where \lambda_0, \lambda_1, \dots, \lambda_{k-1} are constant coefficients and p(n) is the inhomogeneity, then if p(n) is a polynomial with degree r, then this inhomogeneous recurrence can be reduced to a homogeneous recurrence by applying the method of symbolic differencing r times.

General linear homogeneous recurrence relations

Many linear homogeneous recurrence relations may be solved by means of the generalized hypergeometric series. Special cases of these lead to recurrence relations for the orthogonal polynomials, and many special functions. For example, the solution to

J_{n+1}=\frac{2n}{z}J_n-J_{n-1}

is given by

J_n=J_n(z) \,,

the Bessel function, while

(b-n)M_{n-1} +(2n-b-z)M_n - nM_{n+1}=0 \,

is solved by

M_n=M(n,b;z) \,

the confluent hypergeometric series.

Solving a rational difference equation

Main article: Rational difference equation

A rational difference equation has the form w_{t+1} = \tfrac{aw_t+b}{cw_t+d}. Such an equation can be solved by writing w_t as a nonlinear transformation of another variable x_t which itself evolves linearly. Then standard methods can be used to solve the linear difference equation in x_t.

Stability

Stability of linear higher-order recurrences

The linear recurrence of order d,

a_n = c_1a_{n-1} + c_2a_{n-2}+\dots+c_da_{n-d},  \,

has the characteristic equation

\lambda^{d} - c_1 \lambda^{d-1} - c_2 \lambda^{d-2} - \dots - c_d \lambda^{0} =0.

The recurrence is stable, meaning that the iterates converge asymptotically to a fixed value, if and only if the eigenvalues (i.e., the roots of the characteristic equation), whether real or complex, are all less than unity in absolute value.

Stability of linear first-order matrix recurrences

Main article: Matrix difference equation

In the first-order matrix difference equation

[x_t - x^*] = A[x_{t-1}-x^*]\,

with state vector x and transition matrix A, x converges asymptotically to the steady state vector x* if and only if all eigenvalues of the transition matrix A (whether real or complex) have an absolute value which is less than 1.

Stability of nonlinear first-order recurrences

Consider the nonlinear first-order recurrence

x_n=f(x_{n-1}). \,

This recurrence is locally stable, meaning that it converges to a fixed point x* from points sufficiently close to x*, if and only if the slope of f in the neighborhood of x* is smaller than unity in absolute value: that is,

| f' (x^*) | < 1. \,

Note that a nonlinear recurrence could have multiple fixed points, in which case some fixed points may be locally stable and others locally unstable; for continuous f two adjacent fixed points cannot both be locally stable.

A nonlinear recurrence relation could also have a cycle of period k for k > 1. Such a cycle is stable, meaning that it attracts a set of initial conditions of positive measure, if the composite function g(x)�:= f \circ f \circ \cdot \cdot \cdot \circ f(x) with f appearing k times is locally stable according to the same criterion:

| g' (x^*) | < 1, \,

where x* is any point on the cycle.

In a chaotic recurrence relation, the variable x stays in a bounded region but never converges to a fixed point or an attracting cycle; any fixed points or cycles of the equation are unstable. See also logistic map, dyadic transformation, and tent map.

Relationship to differential equations

When solving an ordinary differential equation numerically, one typically encounters a recurrence relation. For example, when solving the initial value problem

y'(t) = f(t,y(t)), \ \ y(t_0)=y_0,

with Euler's method and a step size h, one calculates the values

y_0=y(t_0), \ \ y_1=y(t_0+h), \ \ y_2=y(t_0+2h), \ \dots

by the recurrence

\, y_{n+1} = y_n + hf(t_n,y_n).

Systems of linear first order differential equations can be discretized exactly analytically using the methods shown in the discretization article.

Applications

Biology

Some of the best-known difference equations have their origins in the attempt to model population dynamics. For example, the Fibonacci numbers were once used as a model for the growth of a rabbit population.

The logistic map is used either directly to model population growth, or as a starting point for more detailed models. In this context, coupled difference equations are often used to model the interaction of two or more populations. For example, the Nicholson-Bailey model for a host-parasite interaction is given by

N_{t+1} = \lambda N_t e^{-aP_t} \,
P_{t+1} = N_t(1-e^{-aP_t}) \,,

with N_t representing the hosts, and P_t the parasites, at time t.

Integrodifference equations are a form of recurrence relation important to spatial ecology. These and other difference equations are particularly suited to modeling univoltine populations.

Digital signal processing

In digital signal processing, recurrence relations can model feedback in a system, where outputs at one time become inputs for future time. They thus arise in infinite impulse response (IIR) digital filters.

For example, the equation for a "feedforward" IIR comb filter of delay T is:

y_t = (1 - \alpha) x_t + \alpha y_{t - T}

Where x_t is the input at time t, y_t is the output at time t, and \alpha controls how much of the delayed signal is fed back into the output. From this we can see that

y_t = (1 - \alpha) x_t + \alpha ((1-\alpha) x_{t-T} + \alpha y_{t - 2T})
y_t = (1 - \alpha) x_t + (\alpha-\alpha^2) x_{t-T}  + \alpha^2 y_{t - 2T}

etc.

Economics

Recurrence relations, especially linear recurrence relations, are used extensively in both theoretical and empirical economics.[6] In particular, in macroeconomics one might develop a model of various broad sectors of the economy (the financial sector, the goods sector, the labor market, etc.) in which some agents' actions depend on lagged variables. The model would then be solved for current values of key variables (interest rate, real GDP, etc.) in terms of exogenous variables and lagged endogenous variables. See also time series analysis.

See also

References

  1. Gilson, Bruce R. (2009). The Fibonacci Sequence and Beyond. CreateSpace. pp. 16 ff.. ISBN 978-1449974114. 
  2. Discussion on Binet's formulas
  3. Partial difference equations, Sui Sun Cheng, CRC Press, 2003, ISBN 9780415298841
  4. Chiang, Alpha C., Fundamental Methods of Mathematical Economics, third edition, McGraw-Hill, 1984.
  5. Papanicolaou, Vassilis, "On the asymptotic stability of a class of linear difference equations," Mathematics Magazine 69(1), February 1996, 34-43.
  6. Sargent, Thomas J., Dynamic Macroeconomic Theory, Harvard University Press, 1987.

External links